The Importance of Non-Data-Communication Overheads in MPI

نویسندگان

  • Pavan Balaji
  • Anthony Chan
  • William Gropp
  • Rajeev Thakur
  • Ewing L. Lusk
چکیده

With processor speeds no longer doubling every 18-24 months owing to the exponential increase in power consumption and heat dissipation, modern HEC systems tend to rely lesser on the performance of single processing units. Instead, they rely on achieving high-performance by using the parallelism of a massive number of low-frequency/low-power processing cores. Using such low-frequency cores, however, puts a premium on end-host preand post-communication processing required within communication stacks, such as the message passing interface (MPI) implementation. Similarly, small amounts of serialization within the communication stack that were acceptable on small/medium systems can be brutal on massively parallel systems. Thus, in this paper, we study the different non-data-communication overheads within the MPI implementation on the IBM Blue Gene/P system. Specifically, we analyze various aspects of MPI, including the MPI stack overhead itself, overhead of allocating and queueing requests, queue searches within the MPI stack, multi-request operations and various others. Our experiments, that scale up to 131,072 cores of the largest Blue Gene/P system in the world (80% of the total system size), reveal several insights into overheads in the MPI stack, which were previously not considered significant, but can have a substantial impact on such massive systems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Non-data-communication Overheads in MPI: Analysis on Blue Gene/P

Modern HEC systems, such as Blue Gene/P, rely on achieving high-performance by using the parallelism of a massive number of low-frequency/low-power processing cores. This means that the local preand post-communication processing required by the MPI stack might not be very fast, owing to the slow processing cores. Similarly, small amounts of serialization within the MPI stack that were acceptabl...

متن کامل

Architecture Specific Communication Optimizations for Structured Adaptive Mesh-refinement Applications

OF THE THESIS Architecture Specific Communication Optimizations for Structured Adaptive Mesh-Refinement Applications by Taher Saif Thesis Director: Professor Manish Parashar Dynamic Structured Adaptive Mesh Refinement (SAMR) techniques for solving partial differential equations provide a means for concentrating computational effort to appropriate regions in the computational domain. Parallel im...

متن کامل

Handling Datatypes in MPI-3 One Sided

With the rise of modern interconnects offering direct remote memory access (RDMA), one sided programming becomes increasingly popular. The MPI-3.0 one sided chapter specifies a programming interface that leverages these features directly. foMPI (fast one-sided MPI, [3]) was introduced as a reference implementation to provide highest performance and minimal overheads. Many applications have to c...

متن کامل

Understanding the Behavior and Performance of Non-blocking Communications in MPI

The behavior and performance of MPI non-blocking message passing operations are sensitive to implementation specifics as they are heavily dependant on available system level buffers. In this paper we investigate the behavior of non-blocking communication primitives provided by popular MPI implementations and propose strategies for these primitives than can reduce processor synchronization overh...

متن کامل

GSLHA: Group-based Secure Lightweight Handover Authentication Protocol for M2M Communication

Machine to machine (M2M) communication, which is also known as machine type communication (MTC), is one of the most fascinating parts of mobile communication technology and also an important practical application of the Internet of Things. The main objective of this type of communication, is handling massive heterogeneous devices with low network overheads and high security guarantees. Hence, v...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IJHPCA

دوره 24  شماره 

صفحات  -

تاریخ انتشار 2010